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5 <110> APPLICANT: Wright, David A. 

7 Voytas, Daniel P. 

II <120> TITLE OF INVENTION: Plant Retroelements and Methods Related Thereto 
15 <130> FILE REFERENCE: P-1065 ISURF Plant Retroelement 

19 <140> CURRENT APPLICATION NUMBER: 09/965,553 

20 <141> CURRENT FILING DATE: 2001-09-27 

22 <150> PRIOR APPLICATION NUMBER: 09/322,478 

23 <151> PRIOR FILING DATE: 1999-05-28 

27 <150> PRIOR APPLICATION NUMBER: 60/087125 
29 <151> PRIOR FILING DATE: 1998-05-29 
33 <160> NUMBER OF SEQ ID NOS : 41 
37 <170> SOFTWARE: Patentln Ver . 2.0 
41 <210> SEQ ID NO: 1 
43 <211> LENGTH: 18 
45 <212> TYPE: DNA 
47 <213> ORGANISM: Glycine max 
51 <400> SEQUENCE: 1 

53 tggcgccgtt gccaattg 18 
57 <210> SEQ ID NO: 2 
59 <211> LENGTH: 18 
61 <212> TYPE: DNA 
63 <213> ORGANISM: Glycine max 
67 <4 00> SEQUENCE: 2 

69 tggcgccgtt gtcgggga 18 
73 <210> SEQ ID NO: 3 
75 <211> LENGTH: 6 
77 <212> TYPE: DNA 
79 <213> ORGANISM: Glycine max 
83 <400> SEQUENCE: 3 

85 ttgggg 6 
89 <210> SEQ ID NO: 4 
91 <211> LENGTH: 7 
93 <212> TYPE: PRT 

95 <213> ORGANISM: Artificial Sequence 
99 <220> FEATURE: 

101 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 
103 retroelement sequence 

107 <400> SEQUENCE: 4 
109 Met Ala Ser Arg Lys Arg Lys 

III 1 5 
117 <210> SEQ ID NO: 5 
119 <211> LENGTH: 1263 
121 <212> TYPE: DNA 

123 <213> ORGANISM: Artificial Sequence 
127 <220> FEATURE: 

129 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 
131 retroelement sequence 
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135 <4 00> SEQUENCE: 5 

137 atggcctccc gtaaacgcaa agctgtgccc acacccgggg aagcgtccaa ctgggactct 60 

139 tcacgtttca ctttcgagat tgcttggcac agataccagg atagcattca gctccggaac 120 

141 atccttccag agaggaatgt agagcttgga ccagggatgt ttgatgagtt cctgcaggaa 180 

14 3 ctccagaggc tcagatggga ccaggttctg acccgacttc cagagaagtg gattgatgtt 24 0 

14 5 gctctggtga aggagtttta ctccaaccta tatgatccag aggaccacag tccgaagttt 300 

147 tggagtgttc gaggacaggt tgtgagattt gatgctgaga cgattaatga tttcctcgac 360 

149 accccggtca tcttggcaga gggagaggat tatccagcct actctcagta cctcagcact 420 

151 cctccagacc atgatgccat cctttccgct ctgtgtactc cagggggacg atttgttctg 480 

153 aatgttgata gtgccccctg gaagctgctg cggaaggatc tgatgacgct cgcgcagaca 540 

155 tggagtgtgc tctcttattt taaccttgca ctgacttttc acacttctga tattaatgtt 600 

157 gacagggccc gactcaatta tggcttggtg atgaagatgg acctggacgt gggcagcctc 660 

159 atttctcttc agatcagtca gatcgcccag tccatcactt ccaggcttgg gttcccagcg 720 

161 ttgatcacaa cactgtgtga gattcagggg gttgtctctg ataccctgat. ttttgagtca 780 

163 ctcagtcctg tgatcaacct tgcctacatt aagaagaact gctggaaccc tgccgatcca 840 

165 tctatcacat ttcaggggac ccgccgcacg cgcaccagag cttcggcgtc ggcatctgag 900 

167 gctcctcttc catcccagca tccttctcag cctttttccc agagaccacg gcctccactt 960 

169 ctatccacct cagcacctcc atacatgcat ggacagatgc tcaggtcctt gtaccagggt 1020 

171 cagcagatca tcattcagaa cctgtatcga ttgtccctac atttgcagat ggatctgcca 1080 

173 ctcatgactc cggaggccta tcgtcagcag gtcgccaagc taggagacca gccctccact 1140 

175 gacagggggg aagagccttc tggagccgct gctactgagg atcctgccgt tgatgaagac 1200 

177 ctcatagctg acttggctgg cgctgattgg agcccatggg cagacttggg cagaggcagc 1260 
179 tga 1263 
183 <210> SEQ ID NO: 6 
185 <211> LENGTH: 421 
187 <212> TYPE: PRT 

189 <213> ORGANISM: Artificial Sequence 
193 <220> FEATURE: 

195 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 
197 retroelement sequence 

201 <400> SEQUENCE: 6 



203 


Met 


Ala 


Ser 


Arg 


Lys 


Arg 


Lys 


Ala 


Val 


Pro 


Thr 


Pro 


Gly 


Glu 


Ala 


Ser 


205 


1 








5 










10 








15 




209 


Asn 


Trp 


Asp 


Ser 


Ser 


Arg 


Phe 


Thr 


Phe 


Glu 


He 


Ala 


Trp 


His 


Arg 


Tyr 


211 








20 










25 










30 






215 


Gin 


Asp 


Ser 


He 


Gin 


Leu 


Arg 


Asn 


He 


Leu 


Pro 


Glu 


Arg 


Asn 


Val 


Glu 


217 






35 










40 










45 








221 


Leu 


Gly 


Pro 


Gly 


Met 


Phe 


Asp 


Glu 


Phe 


Leu 


Gin 


Glu 


Leu 


Gin 


Arg 


Leu 


223 




50 










55 










60 










227 


Arg 


Trp 


Asp 


Gin 


Val 


Leu 


Thr 


Arg 


Leu 


Pro 


Glu 


Lys 


Trp 


He 


Asp 


Val 


229 


65 










70 










75 










80 


233 


Ala 


Leu 


Val 


Lys 


Glu 


Phe 


Tyr 


Ser 


Asn 


Leu 


Tyr 


Asp 


Pro 


Glu 


Asp 


His 


2.35 










85 










90 










95 




239 


Ser 


Pro 


Lys 


Phe 


Trp 


Ser 


Val 


Arg 


Gly 


Gin 


Val 


Val 


Arg 


Phe 


Asp 


Ala 


241 








100 










105 










110 






245 


Glu 


Thr 


He 


Asn 


Asp 


Phe 


Leu 


Asp 


Thr 


Pro 


Val 


He 


Leu 


Ala 


Glu 


Gly 


247 






115 










120 










125 








251 


Glu 


Asp 


Tyr 


Pro 


Ala 


Tyr 


Ser 


Gin 


Tyr 


Leu 


Ser 


Thr 


Pro 


Pro 


Asp 


His 


253 




130 










135 










140 
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257 


Asp 


Ala 


He- 


Leu 


Ser 


Ala 


Leu 


Cys 


Thr 


Pro 


Gly 


Gly 


Arg 


Phe 


Val 


Leu 


259 


145 










150 










155 










160 


263 


Asn 


Val 


Asp 


Ser 


Ala 


Pro 


Trp 


Lys 


Leu 


Leu 


Arg 


Lys 


Asp 


Leu 


Met 


Thr 


265 










165 










170 










175 




269 


Leu 


Ala 


Gin 


Thr 


Trp 


Ser 


Val 


Leu 


Ser 


Tyr 


Phe 


Asn 


Leu 


Ala 


Leu 


Thr 


271 








180 










185 










190 






275 


Phe 


His 


Thr 


Ser 


Asp 


He 


Asn 


Val 


Asp 


Arg 


Ala 


Arg 


Leu 


Asn 


Tyr 


Gly 


277 






195 










200 










205 








281 


Leu 


Val 


Met 


Lys 


Met 


Asp 


Leu 


Asp 


Val 


Gly 


Ser 


Leu 


He 


Ser 


Leu 


Gin 


283 




210 










215 










220 










287 


He 


Ser 


Gin 


He 


Ala 


Gin 


Ser 


He 


Thr 


Ser 


Arg 


Leu 


Gly 


Phe 


Pro 


Ala 


289 


225 










230 










235 










240 


293 


Leu 


He 


Thr 


Thr 


Leu 


Cys 


Glu 


He 


Gin 


Gly 


Val 


Val 


Ser 


Asp 


Thr 


Leu 


295 










245 










250 










255 




299 


He 


Phe 


Glu 


Ser 


Leu 


Ser 


Pro 


Val 


He 


Asn 


Leu 


Ala 


Tyr 


He 


Lys 


Lys 


301 








260 










265 










270 






305 


Asn 


Cys 


Trp 


Asn 


Pro 


Ala 


Asp 


Pro 


Ser 


He 


Thr 


Phe 


Gin 


Gly 


Thr 


Arg 


307 






275 










280 










285 








311 


Arg 


Thr 


Arg 


Thr 


Arg 


Ala 


Ser 


Ala 


Ser 


Ala 


Ser 


Glu 


Ala 


Pro 


Leu 


Pro 


313 




290 










295 










300 










317 


Ser 


Gin 


His 


Pro 


Ser 


Gin 


Pro 


Phe 


Ser 


Gin 


Arg 


Pro 


Arg 


Pro 


Pro 


Leu 


319 


305 










310 










315 










320 


323 


Leu 


Ser 


Thr 


Ser 


Ala 


Pro 


Pro 


Tyr 


Met 


His 


Gly 


Gin 


Met 


Leu 


Arg 


Ser 


325 










325 










330 










335 




329 


Leu 


Tyr 


Gin 


Gly 


Gin 


Gin 


He 


He 


He 


Gin 


Asn 


Leu 


Tyr 


Arg 


Leu 


Ser 


331 








340 










345 










350 






335 


Leu 


His 


Leu 


Gin 


Met 


Asp 


Leu 


Pro 


Leu 


Met 


Thr 


Pro 


Glu 


Ala 


Tyr 


Arg 


337 






355 










360 










365 








341 


Gin 


Gin 


Val 


Ala 


Lys 


Leu 


Gly 


Asp 


Gin 


Pro 


Ser 


Thr 


Asp 


Arg 


Gly 


Glu 


343 




370 










375 










380 










347 


Glu 


Pro 


Ser 


Gly 


Ala 


Ala 


Ala 


Thr 


Glu 


Asp 


Pro 


Ala 


Val 


Asp 


Glu 


Asp 


349 


385 










390 










395 










400 


353 


Leu 


He 


Ala 


Asp 


Leu 


Ala 


Gly 


Ala 


Asp 


Trp 


Ser 


Pro 


Trp 


Ala 


Asp 


Leu 


355 










405 










410 










415 




359 


Gly Arg 


Gly 


Ser 


Glx 

























361 420 

367 <210> SEQ ID NO : 7 

369 <211> LENGTH: 1596 

371 <212> TYPE: DNA 

373 <213> ORGANISM: Artificial Sequence 

377 <220> FEATURE: 

379 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 
381 retroelement sequence 

385 <400> SEQUENCE: 7 

387 atgcgaggta gaactgcatc tggagacgtt gttcctatta acttagaaat tgaagctacg 60 

389 tgtcggcgta acaacgctgc aagaagaaga agggagcaag acatagaagg aagtagttac 120 

391 acctcacctc ctccttctcc aaattatgct cagatggacg gggaaccggc acaaagagtc 180 

393 acactagagg acttctctaa taccaccact cctcagttct ttacaagtat cacaaggccg 240 

395 gaagtccaag cagatctcct tactcaaggg aacctcttcc atggtcttcc aaatgaagat 300 



file://C:\Crf3\OutholcKVsrI965553.htm 



12/6/01 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/965 , 553 



DATE: 12/06/2001 
TIME: 11:36:05 



Input Set : N:\Crf3\RULE60\09965553.txt 
Output Set: N:\CRF3\12062001\I965553.raw 

397 ccatatgcgc atctagcctc atacatagag atatgcagca ccgttaaaat cgccggagtt 360 

399 ccaaaagatg cgatactcct taacctcttt tccttttccc tagcaggaga ggcaaaaaga 420 

401 tggttgcact cctttaaagg caatagctta agaacatggg aagaagtagt ggaaaaattc 480 

403 ttaaagaagt atttcccaga gtcaaagacc gtcgaacgaa agatggagat ttcttatttc 540 

405 catcaatttc tggatgaatc ccttagcgaa gcactagacc atttccacgg attgctaaga 600 

407 aaaacaccaa cacacagata cagcgagcca gtacaactaa acatattcat cgatgacttg 660 

409 caactcttaa tcgaaacagc tactagaggg aagatcaagc tgaagactcc cgaagaagcg 720 

411 atggagctcg tcgagaacat ggcggctagc gatcaagcaa tccttcatga tcacacttat 780 

413 gttcccacaa aaagaagcct cttggagctt agcacgcagg acgcaacttt ggtacaaaac 840 

415 aagctgttga cgaggcagat agaagccctc atcgaaaccc tcagcaagct gcctcaacaa 900 

417 ttacaagcga taagttcttc ccactcttct gttttgcagg tagaagaatg ccccacatgc 960 

419 agagggacac atgagcctgg acaatgtgca agccaacaag acccctctcg tgaagtaaat 1020 

421 tatataggca tactaaatcg ttacggattt cagggctaca accagggaaa tccatctgga 1080 

423 ttcaatcaag gggcaacaag atttaatcac gagccaccgg ggtttaatca aggaagaaac 1140 

425 ttcatgcaag gctcaagttg gacgaataaa ggaaatcaat ataaggagca aaggaaccaa 1200 

427 ccaccatacc agccaccata ccagcaccct agccaaggtc cgaatcagca agaaaagccc 1260 

429 accaaaatag aggaactgct gctgcaattc atcaaggaga caagatcaca tcaaaagagc 1320 

4 31 acggatgcag ccattcggaa tctagaagtt caaatgggcc aactggcgca tgacaaagcc 1380 

433 gaacggccca ctagaacttt cggtgctaac atggagagaa gaaccccaag gaaggataaa 1440 

435 gcagtactga ctagagggca gagaagagcg caggaggagg gtaaggttga aggagaagac 1500 

437 tggccagaag aaggaaggac agagaagaca gaagaagaag agaaggtggc agaagaacct 1560 
439 aagcgtacca agagccagag agcaagggaa gccaag 1596 
443 <210> SEQ ID NO : 8 
445 <211> LENGTH: 532 
447 <212> TYPE: PRT 

449 <213> ORGANISM: Artificial Sequence 
4 53 <220> FEATURE: 

455 <223> OTHER INFORMATION: Description of Artificial Sequence: plant 
4 57 retroelement sequence 

461 <400> SEQUENCE: 8 

463 Met Arg Gly Arg Thr Ala Ser Gly Asp Val Val Pro lie Asn Leu Glu 

465 1 5, 10 15 

469 lie Glu Ala Thr Cys Arg Arg Asn Asn Ala Ala Arg Arg Arg Arg Glu 

471 20 25 30 . 

475 Gin Asp lie Glu Gly Ser Ser Tyr Thr Ser Pro Pro Pro Ser Pro Asn 

477 35 40 45 

4 81 Tyr Ala Gin Met Asp Gly Glu Pro Ala Gin Arg Val Thr Leu Glu Asp 

483 50 55 60 

4 87 Phe Ser Asn Thr Thr Thr Pro Gin Phe Phe Thr Ser lie Thr Arg Pro 

489 65 70 75 80 

493 Glu Val Gin Ala Asp Leu Leu Thr Gin Gly Asn Leu Phe His Gly Leu 

495 85 90 95 

499 Pro Asn Glu Asp Pro Tyr Ala His Leu Ala Ser Tyr lie Glu lie Cys 

501 100 105 110 

505 Ser Thr Val Lys lie, Ala Gly Val Pro Lys Asp Ala lie Leu Leu Asn 

507 115 120 125 

511 Leu Phe Ser Phe Ser Leu Ala Gly Glu Ala Lys Arg Trp Leu His Ser 

513 130 135 140 

517 Phe Lys Gly Asn Ser Leu Arg Thr Trp Glu Glu Val Val Glu Lys Phe 
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519 


145 










150 










155 










160 


523 


Leu 


Lys 


Lys 


Tyr 


Phe 


Pro 


Glu 


Ser 


Lys 


Thr 


Val 


Glu 


Arg 


Lys 


Met 


Glu 


525 










165 










170 










175 




529 


He 


Ser 


Tyr 


Phe 


His 


Gin 


Phe 


Leu 


Asp 


Glu 


Ser 


Leu 


Ser 


Glu 


Ala 


Leu 


531 








180 










185 










190 






535 


Asp 


His 


Phe 


His 


Gly 


Leu 


Leu 


Arg 


Lys 


Thr 


Pro 


Thr 


His 


Arg 


Tyr 


Ser 


537 






195 










200 










205 








541 


Glu 


Pro 


Val 


Gin 


Leu 


Asn 


He 


Phe 


He 


Asp 


Asp 


Leu 


Gin 


Leu 


Leu 


He 


543 




210 










215 










220 










547 


Glu 


Thr 


Ala 


Thr 


Arg 


Gly 


Lys 


He 


Lys 


Leu 


Lys 


Thr 


Pro 


Glu 


Glu 


Ala 


549 


225 










230 










235 










240 


553 


Met 


Glu 


Leu 


Val 


Glu 


Asn 


Met 


Ala 


Ala 


Ser 


Asp 


Gin 


Ala 


He 


Leu 


His 


555 










245 










250 










255 




559 


Asp 


His 


Thr 


Tyr 


Val 


Pro 


Thr 


Lys 


Arg 


Ser 


Leu 


Leu 


Glu 


Leu 


Ser 


Thr 


561 








260 










265 










270 






565 


Gin 


Asp 


Ala 


Thr 


Leu 


Val 


Gin 


Asn 


Lys 


Leu 


Leu 


Thr 


Arg 


Gin 


He 


Glu 


567 






275 










280 










285 








571 


Ala 


Leu 


He 


Glu 


Thr 


Leu 


Ser 


Lys 


Leu 


Pro 


Gin 


Gin 


Leu 


Gin 


Ala 


He 


573 




290 










295 










300 










577 


Ser 


Ser 


Ser 


His 


Ser 


Ser 


Val 


Leu 


Gin 


Val 


Glu 


Glu 


Cys 


Pro 


Thr 


Cys 


579 


305 










310 










315 










320 


583 


Arg 


Gly 


Thr 


His 


Glu 


Pro Gly 


Gin 


Cys 


Ala 


Ser 


Gin 


Gin 


Asp 


Pro 


Ser 


585 










325 










330 










335 




589 


Arg 


Glu 


Val 


Asn 


Tyr 


He 


Gly 


He 


Leu 


Asn 


Arg 


Tyr 


Gly 


Phe 


Gin 


Gly 


591 








340 










345 










350 






595 


Tyr 


Asn 


Gin 


Gly 


Asn 


Pro 


Ser 


Gly 


Phe 


Asn 


Gin 


Gly 


Ala 


Thr 


Arg 


Phe 


597 






355 










360 










365 








601 


Asn 


His 


Glu 


Pro 


Pro 


Gly 


Phe 


Asn 


Gin 


Gly 


Arg 


Asn 


Phe 


Met 


Gin 


Gly 


603 




370 










375 










380 










607 


Ser 


Ser 


Trp 


Thr 


Asn 


Lys 


Gly 


Asn 


Gin 


Tyr 


Lys 


Glu 


Gin 


Arg 


Asn 


Gin 


609 


385 










390 










395 










400 


613 


Pro 


Pro 


Tyr 


Gin 


Pro 


Pro 


Tyr 


Gin 


His 


Pro 


Ser 


Gin 


Gly 


Pro 


Asn 


Gin 


615 










405 










410 










415 




619 


Gin 


Glu 


Lys 


Pro 


Thr 


Lys 


He 


Glu 


Glu 


Leu 


Leu 


Leu 


Gin 


Phe 


He 


Lys 


621 








420 










425 










430 






625 


Glu 


Thr 


Arg 


Ser 


His 


Gin 


Lys 


Ser 


Thr 


Asp 


Ala 


Ala 


He 


Arg 


Asn 


Leu 


627 






435* 










440 










445 








631 


Glu 


Val 


Gin 


Met 


Gly 


Gin 


Leu 


Ala 


His 


Asp 


Lys 


Ala 


Glu 


Arg 


Pro 


Thr 


633 




450 










455 










460 










637 


Arg 


Thr 


Phe 


Gly 


Ala 


Asn 


Met 


Glu 


Arg 


Arg 


Thr 


Pro 


Arg 


Lys 


Asp 


Lys 


639 


465 










470 










475 










480 


643 


Ala 


Val 


Leu 


Thr 


Arg 


Gly 


Gin 


Arg 


Arg 


Ala 


Gin 


Glu 


Glu 


Gly 


Lys 


Val 


645 










485 










490 










495 




649 


Glu 


Gly 


Glu 


Asp 


Trp 


Pro 


Glu 


Glu 


Gly 


Arg 


Thr 


Glu 


Lys 


Thr 


Glu 


Glu 


651 








500 










505 










510 






655 


Glu 


Glu 


Lys 


Val 


Ala 


Glu 


Glu 


Pro 


Lys 


Arg 


Thr 


Lys 


Ser 


Gin 


Arg 


Ala 


657 






515 










520 










525 








661 


Arg 


Glu 


Ala 


Lys 


























663 




530 
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